Predicting citation count of Bioinformatics papers within four years of publication
نویسندگان
چکیده
MOTIVATION Nowadays, publishers of scientific journals face the tough task of selecting high-quality articles that will attract as many readers as possible from a pool of articles. This is due to the growth of scientific output and literature. The possibility of a journal having a tool capable of predicting the citation count of an article within the first few years after publication would pave the way for new assessment systems. RESULTS This article presents a new approach based on building several prediction models for the Bioinformatics journal. These models predict the citation count of an article within 4 years after publication (global models). To build these models, tokens found in the abstracts of Bioinformatics papers have been used as predictive features, along with other features like the journal sections and 2-week post-publication periods. To improve the accuracy of the global models, specific models have been built for each Bioinformatics journal section (Data and Text Mining, Databases and Ontologies, Gene Expression, Genetics and Population Analysis, Genome Analysis, Phylogenetics, Sequence Analysis, Structural Bioinformatics and Systems Biology). In these new models, the average success rate for predictions using the naive Bayes and logistic regression supervised classification methods was 89.4% and 91.5%, respectively, within the nine sections and for 4-year time horizon. AVAILABILITY Supplementary material on this experimental survey is available at http://www.dia.fi.upm.es/~concha/bioinformatics.html CONTACT [email protected]
منابع مشابه
Citation Analysis of the Most Influential Publications in Travel Medicine
Introduction: Citation analysis reflects the extent to which published work has been recognized in the scientific community. The purpose of this study was to characterize the most cited publications in travel medicine.Methods: Travel medicine articles indexed on Scopus which had been published in the English language through 2016 were retrieved independen...
متن کاملThe online attention to certain nuclear medicine topics: An altmetrics study vs. a citation analysis
Introduction: Traditional citation analysis has been greatly criticized because the process of citation accumulation requires considerable time after publication. So, the term “altmetrics” was proposed in 2010 to measure the scientific and social impact of a paper.We performed a search for certain nuclear medicine topics using the altmetrics approach to report the correlation b...
متن کاملمقالههای بینالمللی پراستناد علوم پزشکی کشور در پایگاه اسکوپوس: 2010 تا 2014
Introduction: Scientific output of the Islamic Republic of Iran in Medical Sciences has been increased during recent years as reflected in Scopus database. Moreover, highly cited papers of researchers, institutions and countries can be used in order to study the citation impact and quality of scientific output. The current study investigates the quality of Medical Sciences’ scholarly outp...
متن کاملMisconduct in Research and Publication: a Dilemma That Is Taking Place
Having considered current reports concerning plagiarisms taking place in the global science community, the authors decided to address the principal reasons, which lead to these illegalities. In recent years, misconduct in research, such as plagiarism, fabrication, falsification, guest author, ghost author, self-citation, etc. have been increasing significantly in scientific papers, proving a la...
متن کاملOn Modeling and Predicting Individual Paper Citation Count over Time
Evaluating a scientist’s past and future potential impact is key in decision making concerning with recruitment and funding, and is increasingly linked to publication citation count. Meanwhile, timely identifying those valuable work with great potential before they receive wide recognition and become highly cited papers is both useful for readers and authors in many regards. We propose a method...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 25 24 شماره
صفحات -
تاریخ انتشار 2009